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Abstract 


\  ~ 

This  report  deals  with  the  way  a  network  of  decision  makers  can  control 
an  outside  environment.  The  environment  considered  Is  the  Ballistic  Missile 
Defense  where  defense  resources  are  to  be  allocated.  The  allocation  problem 
Is  one  of  interceptors  against  an  attack.  The  network  of  decision  makers  is 
described  by  an  arbitrary  connection  matrix.  A  decentralized  information 
pattern  Is  considered:  each  decision  maker  Is  a  node  In  the  network  and  has 
access  only  to  the  Information  of  the  nodes  it  is  connected  to.  A  decision 
consists  of  the  allocation  of  Its  defense  resources  to  own  defense  and  to 
support  the  neighbors.  In  the  dynamic  case  part  of  the  available  resources 
have  to  be  put  aside  for  future  use.  The  future  attack  against  which  these 
resources  will  have  to  be  used  is  known  only  via  a  probabilistic  description. 
All  these  decisions  have  to  be  made  using  the  decentralized  Information 
available  to  each  decision  maker.  The  framework  of  solution  for  this  problem 
Is  within  what  is  known  as  team  theory.  Three  decentralized  allocations  are 
proposed  and  compared  via  simulations  with  a  local  strategy  (no  exchange  of 
Information)  and  the  fully  centralized  strategy.  The  results  show  that  the 
most  sophisticated  decentralized  strategy  can  achieve  a  performance  near  the 
one  of  the  centralized.  The  Implications  of  these  results  on  the  design  of 
a  computer  network  to  carry  out  these  functions  are  also  discussed. 


Introduction 


The  problem  of  real-time  allocation  and  control  technology  for  the  BMD 
environment  was  dealt  with  extensively  in  [Rl].  The  desirability  of  using 
Distributed  Computer  Systems/Computer  Networks  (DCS/CN)  was  pointed  out.  Some 
of  the  reasons  for  this  approach  are:  reduced  vulnerability,  more  uniform 
computation  load,  less  reporting  and  communication  problems.  The  achievement  of 
these  goals  has  been  made  feasible  by  the  recent  advances  in  microcomputers.  On 
the  other  hand  such  systems  are  harder  to  analyze  and  their  performance  is  usually 
lower  when  compared  to  a  fully  centralized  system.  Nevertheless,  as  it  is  shown 
in  the  sequel,  a  properly  decentralized  control  can  achieve  a  performance  very 
near  to  a  fully  cnetrallzed  one. 

There  are  two  principal  aspects  of  Distributed  Computing  Systems: 

(i)  The  Internal  structure  Interconnection,  switching  problems,  operating 
systems,  reliability,  architecture,  distributed  data  bases,  etc.) 

(11)  The  use  of  the  DCS/CN  to  control  are  outside  environment. 

Our  philosophy  is  that  item  (11)  is  the  ultimate  goal  of  the  system  and  it 
is  on  this  area  that  our  research  effort  has  focused.  On  the  other  hand,  the 
tool  to  carry  out  (ii)  is  the  DCS/CN.  Thus  the  two  items  cannot  be  really 
separated,  as  pointed  out  in  IRI]:  research  on  both  aspects  should  be  carried 
out  and  unified. 

This  report  presents  the  results  of  our  research  on  (11)  and  points  out  how 
the  control  technology  has  reached  the  stage  to  be  connected  to  the  computer  net¬ 
work.  This  connection  is  discussed  further  in  Section  11. 

The  goal  of  our  research  was  to  understand  how  a  network  of  decision  makers 
(in  practice  a  computer  network)  can  be  used  to  control  a  BMD  environment  by 
carrying  out  the  allocation  of  defense  resources. 


This  report  presents  a  mathematical  formulation  of  a  Ballistic  Missile 
Defense  resource  allocation  problem.  The  motivation  of  this  work  stems  from 
the  desirability  of  using  a  computer  network  for  the  real  time  decisions  In 
the  BMD  problem.  The  focus  of  the  report  is  on  how  to  use  such  a  network, 
represented  here  by  a  set  of  decision  makers,  to  solve  a  control  problem  in  a 
dynamic  stochastic  environment,  l.e.,  how  a  network  should  control  an  outside 
environment.  This  is  different  from  the  problem  of  how  the  network  should  be 
controlled. 

The  specific  problem  to  be  discussed  is  the  allocation  of  Interceptors 
against  an  attack.  The  allocation  decisions  are  made  by  a  set  of  decision 
makers.  Following  the  study  reported  In  [Rl]  which  pointed  out  the  desirability 
of  a  decentrallzed/dlstrlbuted  approach,  the  decision  process  considered  is 
of  the  decentralized  type.  Mathematically,  this  translates  Into  the  fact  that 
the  information  pattern  of  the  set  of  decision  makers  is  "nonnested":  there  is 
no  ordering  of  the  decision  makers  such  that  each  knows  at  l^ast  as  much  as  its 
predecessor.  A  properly  designed  decentralized  decision  making  system  can  be 
close  to  a  centralized  system  in  terms  of  performance.  The  present  report 
deals  with  the  mathematical  formulation  of  this  problem  In  the  context  of  team 
theory  [Ml].  Another  Important  aspect  of  a  decentralized  control  is  the  so- 
called  authority  associated  with  each  decision  ^ker:  the  set  of  allowable 
decisions  for  a  DM.  From  the  control  point  of  view,  the  authority  and  infor¬ 
mation  pattern  define  the  framework  of  the  defense's  optimization  problem. 

The  attack  side  will  have  to  be  described  in  a  suitable  form  such  that  a  well- 
defined  performance  index  emerges  that  can  then  be  optimized  by  the  defense 


within  its  constraints 
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The  team  theory  approach  to  the  3MD  resource  allocation  problem  Is 
presented  in  Section  3,  together  with  Its  limitations.  While  this  approach  Is 
useful  as  a  framework.  Its  limitations  suggest  that  suboptlmal  solutions  have 
to  be  sought.  The  type  of  suboptlmal  solution  that  Is  developed  is  of  the 
"member-by-member  optimal"  type,  which.  In  our  case  does  not  guarantee  global 
optimality.  This  approach  is  then  used  on  a  more  general  problem  formulation 
than  first  presented.  This  more  general  performance  index  Is  presented  In 
Section  4  In  the  context  of  an  arbitrarily  connected  network  of  decision  makers. 
A  first  and  very  simple  decentralized  allocation  strategy  Is  described  In 
Section  S.  Optimization  Is  carried  out  within  a  class  of  decision  functions 
considered.  An  arbitrarily  connected  network  of  decision  making  stations  is 
considered  with  each  making  a  decision  based  upon  Its  available  Information. 

The  Information  available  to  each  decision  maker  consists  of  local  Information 
as  well  as  the  Information  obtained  from  the  stations  to  which  It  has  a 
communication  link.  The  decision  by  a  station  affects  the  performance  of  that 
station  as  well  as  that  of  stations  It  Is  connected  to.  Two  more  sophisticated 
decentralized  strategies  are  discussed  In  Sections  6  and  7.  These  latter 
strategies  have  the  feature  of  preferential  allocation.  Numerical  results 
Indicate  that  the  most  sophisticated  decentralized  strategy  yields  performance 
close  to  fully  centralized  one.  The  extension  to  the  dynamic  situation  Is 
presented  In  Sections  8  and  9  with,  an  example  In  section  10.  Section  11 
presents  the  conclusions  and  recommendations  for  future  work. 


Ik 


The  following  resource  allocation  problem  Is  considered.  The  context  is 


terminal  defense  where  a  set  of  N  defended  points  (silos)  Indexed  1*=1,...,N  is 
assumed  to  be  attacked  by  a  wave  of  reentry  vehicles  (RVs).  The  attack  against 
silo  1  Is  by  RVs.  The  overall  attack  is  described  by  the  N-vector  and  this 
will  be  assumed  to  be  a  random  variable  with  a  known  Joint  distribution.  Each 
silo's  defense  is  decided  upon  by  a  local  decision  maker.  At  the  disposal  of  the 
1-th  such  decision  maker  there  are  r^  units  of  defense  resources  (interceptors). 

A  decentralized  decision  making  with  the  following  characteristics  Is 
assumed: 

1.  Authority:  each  local  decision  maker  (DM)  can  allocate  Interceptors 
for  Its  own  defense  or  its  neighbors'  (defined  for  silo  i  as  1-1  and 
1  +  1). 

2.  Information  pattern:  the  1-th  decision  maker  DM^  knows 

I.  =  {r  .  o  ;  k  =  i-l>  it  1  +1}  (2.1) 

1  k  k 


l.e.,  only  the  situation  of  its  neighbors  in  addition  to  its  own. 

This  means  that  each  DM  observes  a  subset  of  the  components  of  the 
random  variable  a  (the  attack). 

The  probability  of  survival  of  silo  1  in  face  of  the  j-th  RV  attacking  It  i 

if  no  interceptor  Is  allocated  against  It  (a^^  >  0)  is 

^  ^ ’’o  «.2) 

where  p  Is  the  reliability  of  an  attacking  RV.  Denoting  by  p.  the  reliability 

<X  V 

of  an  interceptor  the  probability  of  survival  of  a  defended  silo  in  face  of 
RV  J  is 

Sy  (a^j-l)  -  1  -  p^(l-p^)  ^  p^  (2.3) 

Thu8»  for  a  binary  allocation  a^^  one  can  write 

The  probability  of  survival  of  silo  i  conditioned  on  being  attacked  by 
a^  RVs  and  defended  by  interceptors  is 


III  <11-1  klip 


8 


local  information 
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The  Team  Theory  Approach  to  Solution 

The  stochastic  optimization  problem  formulated  In  the  previous  section,  as 
summarized  by  (2.9),  falls  In  the  category  of  static  team  problems.  However, 
despite  Its  being  a  rather  simplified  BMD  problem.  It  Is  substantially  more 
complex  than  the  team  problems  whose  solutions  are  known  [HI,  Ml]. 

The  general  form  of  static  team  problems  is  as  follows.  The  performance 
Index  Is 


max  E{S(dj^(Ij^),  ....  j^(I^),o]} 
iL 


(3.1) 


where 


(3.2) 


The  raember-by-member  optimality  conditions  are 

max  e|e{S[^^  (Ij^),  ...,  1^' 

*  * 


4 


4^4^*  4+i<4+i^‘  •••’4  (3-3) 

* 

where  d^  ,  jj*l,  are  fixed  decision  rules.  The  above  is  equivalent  to 

max  E{S[d  *(I  ) . n). 

.(IJ  ^  ^ 

4^4^ »  4+1  ^4+1^ . 4 


4'‘i' 


The  main  result  pertaining  to  this  probelm,  which  Is  due  to  Radner  [MlJ 
states  that  If  the  function  S  Is  differentiable  and  concave  In  ^  then  the  member- 
by-member  optimality  conditions  give  a  unique  solution  which  Is  the  optimum 
for  the  overall  probelm.  The  differentiability  requirement  requires  implicitly 
that  the  decision  variable  be  continuous. 

Note  that  in  order  to  carry  out  the  maximization  In  (3.3)  to  obtain  the 
optimum  decision  for  DM^  one  needs  to  know  the  forms  of  the  other  members' 
optimal  decisions,  t.e.,  the  functions 


,1 


I 

I 

I 

I 

I 

I 

I 

I 

I 

I 
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j  -  1 . i-1,  1+1,..., N}  (3.5) 

and  the  conditional  densities 

{p(Ij|l^),  u  =  1 . 1-1,  1+1 . N}  (3.6) 

are  needed  to  carry  out  the  expectation  in  (3.3).  In  other  words,  each  decision 
maker  must  have  a  picture  of  what  the  others  are  doing  and  based  on  what  inform¬ 
ation  they  act. 

In  the  case  where  the  performance  Index  is  quadratic  in  the  decision 
variables  and  the  information  available  to  each  DM  is  a  subset  of  the  random 
vector  Cl,  assumed  normally  distributed,  the  conditions  of  Radner  are  satisfied. 
The  optimal  solution  for  each  decision  variable  is  then  linear  in  its  observa¬ 
tion.  The  key  requirement  (3.5)  that  each  DM  know  the  form  of  the  decision  rule 
of  all  the  other  DMs  is  satisfied  due  to  the  linearity  of  the  solution.  The 
conditional  densities  (3.6)  are  all  normal  and  they  follow  from  the  density 
of  the  vector  £. 

The  main  differences  between  this  problem  and  the  BMD  formulation  described 
in  the  previous  section  are: 

1)  The  decision  variables  are  discrete 

2)  The  random  vector  £  is  discrete-valued 

3)  The  performance  index  (2.15)  requires  a  maximization  of  a  sum  of  functions 
which  are  convex  (rather  than  concave,  which  is  the  more  desired  case 

in  maximizations). 

The  implications  of  the  above  properties  of  our  problem  are  that  new, 
discrete  optimization  techniques  will  have  to  be  used.  Furthermore,  the  need  to 
know  the  forms  of  the  decisions  (3.5)  cannot  be  satisfied  as  easily  as  in  the 
linear-quadratlc-gausslan  case  because  the  decision  rules  cannot  be  linear. 

A  second  problem  is  the  specification  of  the  conditional  probability  densities 
(or  rather,  probability  functions)  for  the  pertinent  random  variables. 
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The  main  difficulty  Is  posed  by  the  nonnestedness  of  the  Information  sets, 
which  precludes  the  use  of  the  Principle  of  Optimality.  This  follows  from  the 
fact  the  interleaved  (nested)  extremlzatlons  and  expectations  In  the  stochastic 
version  of  the  Principle  of  Optimality  require  nested  information  sets  (e.g.. 
Bellman  [Bl],  Bar-Shalom  and  Tse  [B2]).  In  contradistinction  with  this,  the 
team  theory  problem  formulation  does  incorporate  the  informational  constraint  of 
our  problem.  However,  this  theory  has  provided  an  explicit  solution  only  to  the 
Llnear-Quadratlc-Gausslan  Problem  (see  McGuire  and  Radner  [Ml],  Ho  and  Chu  [Hi]). 
The  two  properties  required  from  the  cost  function  that  were  crucial  in  obtaining 
the  solution  to  the  Llnear-Quadratic-Gaussian  team  problems  are  concavity  (for 
maximization;  convexity  for  minimization)  and  differentiability.  These  two 
properties  led  to  the  equivalence  of  the  global  optimization  problem  to  the 
so-called  member-by-member  optimization  problem. 

The  unique  aspects  of  the  Ballistic  Missile  Defense  problem  pointed  out  in 
[Rl]  put  It  in  a  class  by  itself  and  our  work  had  as  starting  point  this  very 
important  observation.  The  preliminary  problem  formulation  for  a  Ballistic 
Missile  Defense  Resource  Allocation  Problem  presented  in  Section  2  arrived  at 
a  performance  index  to  be  maximized;  this  performance  index  consisted,  however, 
of  a  sum  of  convex,  rather  than  concave  functions  of  the  resources.  It  seems 
that  proof  of  the  global  optimality  of  the  member-by-member  optimization  Is 
not  feasible. 

The  next  section  presents  a  performance  Index  that  Is  more  general  than  the 
one  developed  In  Section  2.  This  generalized  performance  Is  more  flexible  than 
the  earlier  one  and  is  used  subsequently  in  the  simulations  of  the  various 
decentralized  allocation  strategies  presented  later. 
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4,  A  Generalized  Perforiaance  Index 

There  are  N  stations  which  experience  a  "demand"  (e.g.  attack)  of  units, 

1“1 . N.  The  vector 

a  =  . OjjJ'  C4.1) 

Is  a  random  variable  with  a  known  joint  probability  distribution  function.  Each 
station  Is  a  decision  maker  at  whose  disposal  there  are  r^  ,  1=1,..., N  resource  units 
to  satisfy  the  demand  (e.g.  defense  units  that  counter  the  attack). 

A  syionetrlc  connectivity  matrix 

(4.2) 

with  binary  elements  is  defined.  The  unity  elements  indicate  which  stations  are 
connected.  The  diagonal  elements  are  unity  by  definition.  Connected  stations  (and 
only  those)  exchange  Information  about  their  available  resources  and  the  demand  they 
face  -*  this  Is  done  with  the  purpose  of  allowing  a  decision  maker  to  assign  some  of 
Its  own  resources  to  "help"  the  neighbors.  The  set-up  considered  allows  a  network 
of  stations  with  arbitrary  connections. 

The  Information  structure  at  station  1  Is 


■  ‘'ic-  “k'  “l’ 


«.3) 


where  Is  the  set  of  stations  connected  to  1 

Cl  -  (J  !  -  1}  (4.4) 

The  performance  Index  (survlvavlbillty  of  a  station)  Is  taken  as 

V“i>V 


as  in  Section  2  with  the  notations 
Pi  -  1  -  Pe,  (l-Pfi) 


p  ■  1  -  p 
*^o  a 


(4.6) 

C4,71 


where  p  Is  the  reliability  of  an  attacker  unit  and  p.  the  reliability  of  a  defense 


resource  unit.  Term  (4.6)  is  the  probability  of  survival  of  a  defended  silo  while 


(4.7)  is  the  probability  of  survival  of  an  undefended  silo,  both  in  face  of  a  single 


attacker.  We  shall  allow  use  of  more  defense  resources  than  the  number  of  attacker 


unite,  in  which  case 


The  performance  index  given  by  the  piecewise  analytical  function  described  by 


(4.5)  and  (4.8)  is  plotted  for  the  purpose  of  illustration  in  Fig.  4.1.  The  first 


part  of  the  curve  is  convex  (too  little  resources  are  of  very  small  use)  while  the 


second  part  is  concave  (diminishing  marginal  returns)  in  i  when 


where  d . ,  is  the  allocation  from  i  to  J  (only  if  connected).  Then 


and  the  total  resource  constraint  is 


The  problem  is  then  for  each  decision  maker  to  obtain  an  allocation  of  defense 


units  as  indicated  by  (4.10),  l.e 


-nr  ||  iHi  Tuimiiiii  1  m  -  ,1  f  ■  1  |i  m 


(ii)  for  those  stations  It  is  connected  to  subject  to  the  constraints  (4«12)| 
such  as  to  maximize  the  expected  number  of  surviving  stations  f  or ,  equivalently 
the  average  survivability*  This  latter  form  of  the  performance  is  considered 
further  in  the  next  section  where  a  decentralized  decision  making  algorithm  is 


Figure  4.1  Performance  indes  for 


A  parametric-optimization  decentralized  algorithm  and  simulation  results 


A  problem  with  M'-b  stations  was  considered  with  the  performance  Index  for  each 


station  given  by  the  function  plotted  In  Fig. 4.1.  The  global  objective  function  was 


the  average  survivability 


where  the  expectation  Is  over  all  the  possible  attack  scenarios 


The  following  connection  matrix  between  the  stations  was  assumed 


The  corresponding  network  is  depicted  In  Fig 


Fig.  5.1  Network  of  stations  In  example 


A  variable,  called  threat  level, has  been  defined  as  follows 


Through  the  communication  network  described  by  (5.2)  each  station  knows  the 


threat  level  of  all  the  stations  It  Is  connected  to.  The  following  decision  rule 


Then  the  problem  Is  to  optimize  over  the  parameter  x  called  the  transfer  factor 


The  total  resources  available  to  station  1  for  Its  use  are  then 


Example  1.  The  available  defense  resources  were 


The  nonlnteger  allocation  variables  resulting  from  (5.4),  (5.5)  were  rounded  to 


the  nearest  Integer  below  and  the  remainder  distributed  according  to  decreasing 
magnitudes.  The  overall  performance  index  (5.1)  Is  plotted  as  function  of  the  transfer 
factor  X  in  Fig.  5.2.  The  nonuniqueness  (plateaus)  of  the  optimal  solution  which  Is 
in  this  case  xe[.082,.l]  follows  from  the  Integer  constraint  Imposed  on  the  decision 
variables.  Also  note  the  appearance  of  local  extrema  **  this  is  a  phenomenon  typical 


17 


I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 


to  nonlinear  stochastic  control  problems  as  pointed  out  In  Bar-Shalom  and  Wall  (B4]. 

The  three  equlprobable  scenarios  considered  In  this  first  example  were  as  follows: 
Scenario  1: 

Attack  *  (4  2  3  5  3  4),  worst  case 

Optimum  Average  Survivability  0. 7511 
Alloratlon  matrix  from  station  1  to  j: 

1  2  3  4  5  6 

1  4  0  0  0  0  0 

2  0  2  0  0  0  0 

3  0  0  3  0  0  0 

4  0  0  1  4  0  0 

5  0  0  0  0  3  0 

6  0  0  0  0  0  4 

I  i  I  I  I  i 

4  2  4  4  3  4 

vs. 

4  2  3  5  3  4 

Exchanges  between  stations: 


Scenario  2; 

Attack  -  (2  2  8  5  2  2),  best  case 

Optimum  Average  Survivability  «  0.7586 
Allocation  matrix  from  station  1  to  J: 


Optloum  Average  Survivability  ■  0. 7560 


Allocatioa  matrix  from  station  i  to  J 
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Exchange  between  stations: 


A  major  Issue  Is  where  does  the  proposed  scheme  stand  In  terms  of  Its  performance 
with  respect  to  the  following  alternative  strategies 

(I)  Local  -  each  decision  maker  Is  using  strictly  local  Information,  l.e.,  no 
communication  between  the  stations  and,  therefore,  no  resources  can  be 
exchanged 

(II)  Fully  centralized  -  all  the  Information  Is  available  at  a  single  location 
where  a  global  allocation  of  all  the  available  resources  Is  carried  out. 

The  local  scheme  described  In  (1)  above  will  yield  a  lower  bound  of  the  perfor¬ 
mance  of  the  decentralized  algorithm.  On  the  other  hand,  a  fully  centralized  algorithm 
will  yield  an  upper  bound.  Despite  the  fact  that  this  upper  bound  Is  not  achievable, 
a  major  question  of  Interest  pointed  out  In  [Rl]  Is  how  close  the  decentralized 
algorithm's  performance  can  be  to  the  one  of  the  centralized  algorithm.  From  the 
point  of  view  of  robustness  or  cost  of  Implementation  one  would  favor  a  decentralised 
algorithm  [Rl],  even  though  quantitative  studies  have  not  yet  been  made  In  this  area 
to  obtain  measures  of  robustness. 

The  performance  of  the  local  decision  scheme  can  be  obtained  from  Fig.  5.2  -  it 

corresponds  to  no  exchanges  of  defense  resources  between  the  stations,  l.e.,  x>o. 

* 

The  result  Is  J,  -  0.745  (L  «  local)  and  Is  lower  than  J_  - 


0.755  (DC  -  decentralised) 
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even  though,  the  difference  is  very  small. 

The  fully  centralized  scheme  will  completely  redistribute  the  total  amount  of 
resources  in  an  optimal  way  to  meet  the  demand  (counter  the  attack).  Uote  that  this 
set-up  assumes  not  only  communication  from  each  station  to  the  central  decision  maker 
but  also  the  Implicit  assumption  that  any  resource  can  be  used  anywhere  in  the  net¬ 
work  without  limitations.  The  exact  solution  to  the  fully  centralized  problem  is  a 
general  integer  programming  algorithm,  so  a  simple  Iterative  approach  has  been  adopted. 
The  original  assignment  (5.7)  was  taken  as  initial  allocation.  An  iteration  scheme 
consisting  of  switching  units  of  resources  to  obtain  the  largest  improvement  was  then 
carried  out.  This  was  done  subject  to  the  nonnegativity  constraint.  Such  a  procedure 
will  always  converge  and  lead  to  a  locally  optimal  allocation.  The  final  solution  will 
be  the  global  optimum  if  the  individual  performance  indices  are  concave.  Using  this 


procedure  the  average  survivability  with  the  centralized  decision  scheme  in  Example 
1  was 

Scenario  1:  J  ~  0.807 
Scenario  2:  J  -  0.813 
Scenario  3:  J  ~  0.813 

In  all  three  scenarios  the  centralized  allocation  turned  out  to  be  an  exact  match  of 
the  attack  pattern.  This  was  possible  because  the  total  amount  of  defense  resources 
was  assumed  the  same  as  the  total  number  of  attackers. 

Thus  the  average  survivability  over  the  three  scenarios  was  >  0.811  (C  - 
centralized).  As  expected  one  has  the  ordering 


(5.8) 


Nevertheless  in  this  example  the  sensitivity  of  the  performance  is  not  very  high.  In 
view  of  this  a  different  case  is  considered  next. 


! 


in  this  case  the  available  resources  are 
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d-  [4  2  3  5  1  11  (5.9) 

while  the  attack  scenarios  are  (42  33  3  4),  (2  2  852  2)  and  (225 

8  2  2),  all  equlprobable.  Note  that,  unlike  in  Example  1,  here  the  total  defense 
resources  are  only  16  vs.  attack  size  21.  It  Is  expected  that  in  the  case  where  the 
defense  is  outnumbered  the  sensitivity  of  the  performance  to  the  defense  strategy 
is  higher. 

For  the  local  strategy  the  performance  was: 

Scenario  1:  J  >  0.573 

Scenario  2:  J  ••  0.561 

Scenario  3:  J  ~  0.467 

which  yields  “  0.534 

For  the  decentralized  strategy  specified  earlier  in  this  section  the  average 

survivability  is  plotted  in  Fig.  5.3.  The  optimum  decentralized  performance  turned 

out  to  be  J  =  0.58.  The  optimum  x  is  in  the  interval  [0.181,  0.2]. 

oc 

The  shifting  of  defense  units  by  the  decentralized  algorithm  was  as  follows 
Scenario  1:  J  0.540 


Scenario  2: 


J  =>  0,557 


Scenario  3;  J  *  0.644 


The  completely  centralized  strategy  yielded  the  following  results 
Scenario  1:  J  ■  0.686 

Allocation:  (423034) 

W8.  Attack:  (423534) 

The  fourth  site  Is  given  up. 

Scenario  2:  J  *  0.753 

Allocation:  (330622) 
vs.  Attack:  (228522) 


The  third  site  Is  given  up 


Allocation 


The  fourth  site  is  given  up 


The  average  survivability  with  the  centralized  strategy  was  in  this  case  >  0.731 


Coi^paring  the  three  strategies  it  can  be  seen  that  in  this  example  the  decen¬ 


tralized  outperformed  the  local  by  about  10%  while  the  centralized  stragy  had  a 


performance  about  25%  higher  than  the  decentralized  one.  The  reason  that  the  cen¬ 


tralized  strategy  was  substantially  better  than  the  others  is  that  it  adopted  a 


perferential  defense  mode  -  this  is  a  consequence  of  the  availability  of  global  infor¬ 


mation  at  the  location  of  the  central  decision  maker 
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Decentralized  strategies  with  preferential  allocation. 

This  section  presents  the  description  of  a  defense  strategy  that,  when 
outnumbered  by  the  offense,  should  be  able  to  adopt  a  preferential  mode  according  to 
some  priorities  to  be  computed  in  real  time. 

As  the  previous  section's  computation  results  indicated,  the  partially  decentral¬ 
ized  strategy  based  on  the  balanced  threat  level  (we  call  It  strategy  Is  slightly 
better  than  the  Isolated  strategy.  Somehow  it  still  lacks  the  feature  to  do  the 
defense  preferentially  when  the  total  attack  Is  more  than  the  total  available  defense 
resource, 1. e. ,  there  is  a  need  to  give  up  certain  sites  so  that  other  sites  may  have 
more  resources. 

He  would  like  to  define  a  decentralized  strategy  with  which  the  decision  node 
will  be  able  to  carry  out  assignment  of  priorities  to  the  nodes  within  Its  information 
domain  (and  support  domain).  The  priority  order  in  each  domain  should  be  determined 
based  on  the  total  available  resource  in  the  domain,  and  on  the  attack  pattern  the 
entire  domain  is  facing.  Hopefully,  the  mechanism  to  generate  these  priority  orders 
should  guarantee  the  consistency  of  these  orders  between  any  two  overlapping  domains, 
at  least  approximately.  The  results  of  the  priority  order  should  not  be  known  to 
the  opponent,  and  the  opponent  should  be  prevented  from  being  able  to  generate  such 
information  himself.  Therefore,  for  real  implementation,  some  sort  of  random  coding 
known  only  to  the  defense  nodes  is  necessary  to  generate  such  priority  orders. 

The  following  is  a  scheme  of  decentralized  strategy  with  priority. 

Strategy  PI: 

For  each  decision  node  1,  collect  Information  from  its  domain,  (Domain  i),  and 
do  the  following: 

1.  From  the  attack  pattern  on  its  domain,  and  the  total  available  defense 
resource  within  the  domain,  compute  a  priority  order  for  the  nodes  in  the 
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domain  according  Co  some  rule. 

2.  Determine  a  desirable  defense  pattern  for  all  the  nodes  In  Che  domain  by 
redistributing  Che  total  resources  In  the  domain.  The  rule  vrlll  be  detailed 
later.  Let  Che  desirable  resource  for  node  j  as  viewed  by  node  1  be  y(j,l). 

3.  Compare  y(l,l)  with  the  actual  resource  that  node  1  has.  If  y(l,l)  Is 

siiLsller,  node  1  does  nothing  but  wait  for,  hopefully,  support  from  Its 
neighbors . 

4.  If  y(i,l)  Is  larger  than  what  node  1  actually  has,  It  will  send  away  those 
extra  units  of  resources  to  support  certain  nodes  In  Domain  1.  Those  to  be 
supported  are  the  ones  that  have  their  respective  y(j,l)  more  Chan  their 
actual  assignments.  Such  support  transfers  are  made  proportional  Co  the 
deflcency  In  resource.  Example:  Let  the  actual  resource  in  Domain  1  be 
(4511),  and  the  desirable  defense  allocation  as  viewed  by  node  1  is 
(1235).  Node  1  will  send  3  units  away  to  nodes  3  and  4,  because  the 
latter  two  have  2  and  4  units  in  deficiency  respectively.  One  unit  will 

be  sent  to  node  3,  and  two  units  to  node  4  (1:2>2:4,  l+2«3).  (Convert  to 
integer  solutions  where  needed) 

5.  The  priority  order  in  a  domain  will  be  tentatively  decided  in  the  reverse 
order  of  the  amount  of  attack  In  the  domain.  Namely,  the  node  which  has 
the  minimum  amount  of  attack  gets  the  highest  priority,  etc. 

6.  A  tentative  scheme  to  decide  on  the  desirable  defense  pattern  given  the 
attack  pattern  and  the  priority  order  is  the  following:  From  the  total 
available  resource  in  Domain  1,  assign  as  many  as  available  and  to  the 
extent  of  the  amount  of  the  attacks,  to  the  sites  according  to  the  priority 
order.  Example:  There  are  totally  14  units  of  defense  resource  In  Domain  1. 
The  attack  pattern  is  (1  2  3  9  11).  Then  the  desirable  defense  pattern  Is 
(12350).  When  the  total  amount  of  resources  Is  more  than  the  total 
attack,  allocate  proportionally  according  to  the  attacks. 


Strategy  PI  described  in  Section  6  is  applied  to  Example  2  from  Section  5 


To  Illustrate  this  strategy,  consider  site  4  with  scenario  1.  This  site  Is 


connected  to  3,  5  and  6,  as  Illustrated  below 


The  defense  available  In  this  4th  domain  is  (3511),  which  la  a  total 


of  10,  while  the  attack  Is  (3534).  The  priorities  In  domain  4  are.  In  the 


order  of  Increasing  attack,  as  follows:  site  3,  site  5,  site  6,  site  4,  Thus 


site  4  reallocates  the  total  of  10  defense  resources  according  to  this  priority 


order:  site  3  gets  3  units,  site  5  gets  3  units,  site  6  gets  4  units  (which 


total  already  to  10)  and  site  4  gets  nothing.  The  desirable  allocation 


pattern  In  domain  4  Is  then.  In  view  of  what  DM.  knows,  (30  34),  compared  to 


tha  Initial  allocation  of  (3511).  Thus  site  4  will  deliver  2  units  to  site 


5  and  3  units  to  site  6 


All  the  DM's  carry  out  a  similar  assessment  as  above.  The  results  are 
summarized  below  In  Tables  7.1  -  7.3. 


for  site 


as  viewed 


Action 


Attack 


Initial  allocatlo 


Final  allocation 


Table  7.1  Desirable  allocations  as  viewed  by  the  DMs  with  strategy  PI 
and  the  resulting  allocations  for  Scenario  1. 


The  fourth  row  In  Table  7.1  Is  the  one  discussed  In  detail  earlier. 

Note  the  discrepancy  between  the  desirable  allocations  for  site  4  as  viewed  by 
the  various  DIfe  -  this  is  a  consequence  of  the  decentralized  Information 
pattern.  The  final  allocation  Is  thus  (423034)  for  which  the  performance 
is  J  ■  0.686. 


Table  7.2  shows  the  desirable  allocations  and  the  corresponding  actions  by 


the  DMs  for  scenario  2.  The  performance  In  this  case  Is  J  »  0.550. 
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The  second,  more  sophisticated  decentralized  strategy  PO,  also  described 
earlier,  is  considered  next.  Instead  of  prioritizing  the  sites  within  a 
domain,  an  Integer  programming  type  optimization  Is  carried  out  by  each  DM 
for  allocation  within  its  domain. 


The  result  for  Scenario  1  was  the  same  as  for  strategy  PI,  which  Is  shown 
in  Table  7.1,  and  will  not  be  repeated.  For  the  other  two  scenarios  the  allo¬ 
cations  were  different  and  the  performance  better. 

The  performances  were  for  scenarios  2  and  3,  respectively,  J  «  0.591 
(vs.  0.550)  and  J  0.750  (vs.  0.636).  The  detailed  results  are  presented 
in  Tables  7 .A  and  7 .5. 


for  site 


as  viewed 


Action  by  DM 


(Expects  help) 


(Expects  help) 


Send  3  to  #4 


(Expects  help) 


(Expects  help) 


(Expects  help) 


Final  allocation 


Table  7 . 4  Desirable  allocations  as  viewed  by  the  DMs  with  strategy  PO  and 


the  resulting  allocations  for  Scenario  2. 


I 

I 

I 

I 

I 


^'"'■vlor  site 

as  vleweo'’''^.^ 
by  DM 

1 

2 

3 

4 

5 

6 

Action  by  DM 

1 

3 

4 

Send  1  to  #3 

2 

3 

2 

(Expects  help) 

3 

3 

3 

8 

0 

(Expects  help) 

A 

6 

0 

2 

2 

Send  3  to  #3; 

1  to  //5;  1  to  #6 

5 

0 

4 

3 

(Expects  help) 

6 

0 

3 

4 

(Expects  help) 

Attack 

2 

2 

5 

8 

2 

2 

Initial  allocation 

4 

2 

3 

5 

1 

1 

Final  allocation 

3 

2 

7 

0 

2 

2 

Table  Z5  Desirable  allocations  as  velwed  by  the  DMs  with  strategy  PO 
and  the  resulting  allocations  for  Scenario  2. 

A  comparison  of  the  average  performances  for  the  strategies  considered  Is 
presented  In  Table  7.6 


Strategy 

Expected  Performance 

Completely  decentralised 

0.534 

Strategy  T 

(transfer  factor) 

0.580 

Strategy  PI  (Preferential 
with  priorities) 

0.624 

Strategy  PO  (Preferential 
with  local  optimisation) 

0.676 

Fully  centralised 

0.731 

Table  7 , 6  Comparison  of  the  various  strategies. 


The  local  optimisation  approach  taken  by  strategy  PO  appears  to  pay  off.  Th 
result  Is  quite  close  to  the  upper  bound  given  by  the  centralised  strategy. 


Resource  Allocation  In  a  Dynamic  Environment 


The  problem  of  extending  the  preferential  strategies  to  the  multistage 


The  network  consists  of  N  nodes  connected  according  to  a  (symmetric) 


connectivity  matrix 


with  binary  elements.  The  domain  of  node  1  (the  set  of  nodes  connected  to 


The  Initial  defense  distributions  known  to  DM.  are  d 


The  network  Is  subjected  to  K  waves  of  attack.  In  the  k-th  wave,  the 


1-th  node  Is  attacked  ^7  RVs.  DM^  knows  at  time  1  the  variables  j 

JeC.  as  well  as  the  probability  distribution  of  the  future  attack  against 


its  domain 


Based  on  this  Information  DM.  makes  the  £'.>llowlng  decisions 


The  neighbors  are  then  notified 


against  future  attacks 


of  the  value  of  d 


jeC.  according  to  a  strategy 


Find  the  allocation  of  d. 


discussed  in  the  previous  section 


This  process  Is  repeated  after  every  wave.  When  a  site  is  destroyed,  the 


defense  resources  associated  with  it  are  also  assumed  to  be 


A  Two-wave  Problem 


The  survivability  function  for  a  site  attacked  simultaneously  by  a  RVs 


To  maximize  the  above,  subject  to  £'^0,  6’'i0  and 


However,  in  practice  the  future  attack  Is  not  known  perfectly.  The  best 


information  one  can  have  about  It  Is  a  probabilistic  description  of  its  size 


where  L  is  the  number  of  possible  outcomes  of  the  size  of  the  future  (second 


wave  in  this  case)  attacks 


With  this  and  the  total  probability  theorem,  0  >1)  becomes,  for  an 


uncertain  second  wave 
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Note  that  now  the  optimization  problem  for  this  "present'-future"  allocation  Is 
slightly  more  complicated;  nevertheless  a  numerical  search  can  be  employed  to 
maximize  (9«7).  The  algorithm  Is  presented  In  the  next  section  together  with 
an  example. 


I 

I 
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E 


I 

I 

I 

I 

B 

I 

I 

I 
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10«  The  Dynamic  Allocation  Algorithm  and  an  Example. 

It  Is  assumed  that  all  the  DMs  know  the  probability  distribution  of  the 
equivalent  second  wave  attack  ("future  attack”)  against  the  sites  within  their 
domain.  Thus  DM^  knows 

VjE  Cl  (10.1) 

where  1”1»...»L  in  the  possible  future  attack  against  site  j  within  Its 

domain,  C^. 

> 

The  algorithm  Is  Illustrated  next  on  an  example. 

There  are  5  sites  connected  as  follows: 


The  available  defense  weapons  are 

6-(2  2  4  3  1)  (10.2) 

while  the  first  wave  of  attack  la 


o*-  (  1  2  2  4  1  ) 


(10.3) 
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The  Information  about  the  second  wave  Is 

P  {a"  -  (1  1  1  2  1)}=-| 

P  {a"  -  (1  1  2  1  1)}“^  ao.4i 

P  {o'  -  (1  2  1  1  1)}=-| 

The  parameters  entering  into  the  survivability  function  are  =  0.6  and  p^  =  0.9, 
as  before. 

Each  decision  maker  uses  in  this  algorithm  the  following  variables.  The 
average  first  wave  attack  in  the  domain  of  DM^  is  defined  as 


-  -  I 

i  J 


(10.5) 


where  n^  is  the  number  of  members  of  the  domain  of  DM^, 


«i  -  Z  "ij 


jeC^ 

and  are  the  (binary)  elements  of  the  network's  connection  matrix. 


(10.6) 


The  average  future  attack  o^  against  the  domain  of  DM,  is  a  random 
variable  that  can  take  the  values 


with  probability  p^.  This  follows  from  the  Joint  distribution  of  whlclr 

is  illustrated  in  (10.4).  It  is  assumed  that  all  DMs  have  the  same  probabilistic 
Inforttatlon  about  the  future.  However,  the  algorithm  is  not  affected  at  all  if 
each  DM  nses  a  different  distribution* 


Table  10.1  summarizes  the  information  available  to  each  DM  at  the  time  of 


the  first  wave 
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Decision 

Maker 

1 

Total  Attack 
1st  wave 
against 
domain  1 

Average  1st 
wave  attack 

“i 

Average  second  wave  attack  against 
domain  1  and  its  probability 

Pi 

1 

3 

1.5 

n 

2 

4 

2 

Si 

n 

3 

9 

2.25 

a 

m 

1.25;  i 

4 

7 

2.33 

1.33;  1 

1.33;  ^ 

B 

5 

5 

2.5 

Ml 

BBj 

Table  10.1  Information  available  to  DMs  In 
the  dynamic  example 

The  optimization  C9.7)  is  carried  out  for  the  resources  6^  using 
defined  in  (10.7).  Both  are  Illustrated  for  the  example  in  Table  10.1.  The 
Integer  solutions  resulting  from  optimizing  over  the  present/f uture  allocation 
(9.7)  by  each  DM^  as  specified  above  are  presented  in  Table  10.2. 


DM 

Available  resources 

Allocation  against 

Allocation  against 

1 

«1 

first  wave 

second  wave  4^ 

1 

2 

1 

1 

2 

2 

2 

0  * 

3 

4 

3 

1 

4 

3 

2 

1 

5 

1 

1 

0  ** 

Table  10.2  Preeent/future  allocations  by  Individual 
DMs  using  domain  Information 


*  The  decomposition  is  not  unique:  (1»1)  yields  same  expected  performance 


The  allocations  within  each  domain  against  the  first  wave  as  viewed  by  the 
various  DMs  are  presented  In  Table  10.3.  They  happen  to  be  the  same  for  each  of 


the  following  strategies:  T  (transfer),  PI  (preferential  defense  with  priorities), 
PO  (preferential  defense  with  optimization  and  CC  (completely  centralized).  The 
corresponding  performance  (average  survivability)  after  first  wave  Is  S(l)"  0.796. 


^'v^for  site 

As  vlewe^'v^,,^ 
by  DM 

1 

2 

1 

3 

4 

5 

Action 
by  DM 

1 

1 

1 

3 

i 

none 

2 

2 

3 

none 

3 

1 

2 

2 

3 

send  1  to  #4 

4 

2 

3 

1 

(expects  help) 

5 

2 

1 

none  j 

Attack 

1 

2 

2 

4 

1 

Initial  allocation 
against  first  wave 

1 

2 

3 

2 

1 

Final  allocation 

1 

2 

2 

3 

1 

Table  10.3  Defense  against  first  wave 
(strategies  T,  Pi,  PO,  C). 

The  local  (L)  strategy  which  effects  no  transfers  of  resources  yields 
performance  Sj^(l)~  0.773  after  the  first  wave. 

For  the  second  wave  the  analysis  Is  done  as  follows:  The  effective  resudLnlng 
weapons  after  the  first  wave  Is 

«"  -  S(l)  :  (10.8) 


This  yields  the  following  (noninteger)  values  of  resources  for  strategies 


At  this  point  advantage  Is  taken  of  the  fact  that  the  generalized  performance 


Index  can  be  used  also  for  noninteger  values  of  resources  to  obtain  the 


An  alternative  method  would  be  to  look  at  all  the  possible  outcomes  of  the  first 


wave;  since  this  Is  quite  time  consuming  the  simplified  approach  presented  above 


Is  considered  preferred.  The  following  realization  of  the  (random)  second  wave 


The  local  strategy  with  resources  (10.10)  against  attack  (10.11)  yields  second 


wave  survivability  S. (2)  *■  0.567  which  corresponds  to  an  overall  performance 


The  T  strategy  has  resources  (10.9)  against  attack  CIO. 11)  and  Its  optimal 


The  CC  strategy  has  the  following  (anticipated)  optimal  second  wave  allocation 


0.706  and  overall  performance 


which  yields  second  wave  performance  of  S 
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0.796  •  0.706  -  0.562 


(10.15) 


The  (anticipated)  actions  undertaken  by  stragegy  PI  against  the  second  wave 
are  shown  In  Table  10.4. 


^"v^^^^for  site 

As  vlewM'v.. 
by  DM 

1 

2 

3 

4 

5 

Action 
by  DM 

1 

1.592 

0 

(expects  help) 

2 

0 

.796 

none 

3 

2 

0 

.388 

0 

send  .408 
to  11 

4 

.592 

0 

D 

send  .796 
to  #5 

5 

0 

.796 

(expects  help) 

Attack 

2 

0 

3 

0 

■i 

- 

Initial  allocation 
against  second  wave 

mm 

0 

mm 

mm 

0 

Final  allocation 
against  second  wave 

1.204 

0 

.388 

0 

m 

Table  10.4  The  decisions  of  the  PI  strategy 
In  the  second  wave. 

(2) 

The  second  wave  performance  Is  Sp^'  0.665  and  the  overall  performance 

®P?  "  ^Pl^^^  Sl^^^  “  (10,16) 


The  PO  strsgety's  (anticipated)  decisions  are  shown  In  Table  10.5 


For  site 


As  viewed 


Action 


1.592 


(expects  help) 


2.388 


Attack 


Initial  allocation 
aealnst  second  wave 


Final  allocation 
against  second  wave  1.592 


send  .796 


send  .796 


1.291 


.796  I  (expects  help) 


Table  10.5  The  decisions  of  the  PO  strategy  In 
the  second  wave. 

The  second  wave  performance  Is  Sp_(2)  ■  0.695  and  the  overall  performance 


®P0^  -  SpQ(l)  Spq(2)  -  0.796  •  0.695  -  0.553 


(10.17) 


A  summary  of  the  overall  performances  of  the  five  strategies  for  the  dynamic 
problem  Is  given  In  Table  10.6. 


Strategy 


Performance  for  two-wave  problem  s' 


Table  10.6  Summary  of  results  for  the 
dynamic  problem  simulated. 
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i 


I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

I 

1 

1 


11.  Conclusion  and  Future  Work 

A  set  of  5  strategies  have  been  studied  for  the  static  problem  of  allocating 
defense  resources  against  a  single-wave  attack.  It  has  been  shown  that  a 
decentralized  preferential  strategy  with  local  optimization  can  yield  performance 
nearly  as  good  as  the  fully  centralized  strategy. 

The  feasibility  of  Implementing  decentralized  resource  allocation  strategies 
for  a  dynamic  problem  has  also  been  demonstrated.  The  performance  achieved  with 
the  decentralized  strategy  that  uses  an  optimization  method  for  reallocation  of 
the  resources  within  Its  domain  has  been  shown  to  be  very  close  to  the  performance 
of  the  fully  centralized  strategy. 

In  this  study  we  have  assumed  a  certain  Information  pattern,  which  dictates 
the  communication  requirements  and  the  available  data  base  for  each  decision  maker, 
and  developed  the  methodology  to  evaluate  the  performance  of  the  overall  system. 

The  next  natural  step  following  this  study  Is  the  following:  optimize  the 
connections  In  the  computer  network  (l.e.  among  the  decision  makers)  to  achieve 
maximum  performance  subject  to  a  total  cost  for  the  connections.  This  can  make 
the  transition  from  the  evaluation  to  the  design  stage  of  a  dedicated  computer 
network  whose  task  Is  to  control  a  BMD  environment.  As  a  by-product  of  such  a 
follow-on  study  one  can  obtain  the  relationship  between  overall  cost  and  the 
optimized  performance. 
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